CDS

Accession Number TCMCG042C37368
gbkey CDS
Protein Id XP_016469504.1
Location complement(join(17312..17338,17438..17519,17643..17721,17979..18048,18138..18260,18346..18398,18502..18561,19356..19560,19655..19736,19835..19970,20398..20428))
Gene LOC107791873
GeneID 107791873
Organism Nicotiana tabacum

Protein

Length 315aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016614018.1
Definition PREDICTED: inositol oxygenase 2-like [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category S
Description Myo-inositol oxygenase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01184        [VIEW IN KEGG]
KEGG_rclass RC00465        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00469        [VIEW IN KEGG]
EC 1.13.99.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00053        [VIEW IN KEGG]
ko00562        [VIEW IN KEGG]
map00053        [VIEW IN KEGG]
map00562        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0016491        [VIEW IN EMBL-EBI]
GO:0016701        [VIEW IN EMBL-EBI]
GO:0050113        [VIEW IN EMBL-EBI]
GO:0055114        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGACTTTCCTCGTTGCCCAAACTGAGCATGGAGCAGAAATTGAGAACAAGAAGGTAACTACTGATGCTGAGGAATTGTTTCTTGATGGTGGATTTGTTGTGCCAAAAAATGTTTCAAACGATGGATTTGTTGTTCCTGGCAACAATGCATTTGGCAACTCATTCAGGGATTATAGTGCAGAAACTGATCGGAAAAAGATCGTGGAGGAACTCTATCGACAGAGCCACATTAACCAAACATATGATTTTGTGAAAAAGATGAGGGAAGAGTATGGAAAGTTGGATAAAGTTGAGATGAGCATTTGGGAAAGTTGTGAACTTTTAAATGAAGTTGTGGATGATAGTGATCCTGATTTGGATGAACCCCAAATTCAGCATTTGTTGCAAACTGCTGAAGCTATTAGGAAAGACTATCCTAATGAAGATTGGTTGCATTTGACTGCCCTTATTCATGATCTTGGCAAAGTTCTTCTGCTTCCTAGCTTTGGAGGGCTTCCTCAGTGGGCTGTTGTTGGTGACACATTCCCCCTTGGTTGTGCTTTTGATGAATCAAATGTGCTTTATGAGCAATTTAAGGGAAATCCTGATTACAACAATCCATCTTACAACACAAAAAATGGAGTTTATTCTAAAGGATGTGGGCTTGATAATGTGGTTATGTCTTGGGGACATGATGACTACATGTATTTGGTTGCTAAGGAAAACAAAACTACTCTGCCATCTGCTGCATTGTTCATCATCCGATACCATTCCTTTTATCCTCTGCACAGGGCAGGGGCATATAAACACTTGATGAATGAGGAGGATCATGAAAATCTGAAATGGCTTCATATTTTCAACAAATATGATCTTTATAGCAAGAGCAGAGTTTTGATTGATGTGGAGAAAGTGAAACCTTACTACATTTCCCTCATTGAGAAGTATTTCCCAGCAAAGCTGAAGTGGTGA
Protein:  
MTFLVAQTEHGAEIENKKVTTDAEELFLDGGFVVPKNVSNDGFVVPGNNAFGNSFRDYSAETDRKKIVEELYRQSHINQTYDFVKKMREEYGKLDKVEMSIWESCELLNEVVDDSDPDLDEPQIQHLLQTAEAIRKDYPNEDWLHLTALIHDLGKVLLLPSFGGLPQWAVVGDTFPLGCAFDESNVLYEQFKGNPDYNNPSYNTKNGVYSKGCGLDNVVMSWGHDDYMYLVAKENKTTLPSAALFIIRYHSFYPLHRAGAYKHLMNEEDHENLKWLHIFNKYDLYSKSRVLIDVEKVKPYYISLIEKYFPAKLKW